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ABSTRACT 

Toward the expansion of the genetic alphabet of 
DNA, we present highly efficient unnatural base 
pair systems as an artificial third base pair for 
PCR. Hydrophobic unnatural base pair systems 
between 7-(2-thienyl)imidazo[4,5-b]pyridine (Ds) 
and 2-nitro-4-propynylpyrrole (Px) were fine-tuned 
for efficient PCR, by assessing the amplification ef- 
ficiency and fidelity using different polymerases and 
template sequence contexts and modified Px bases. 
Then, we found that some modifications of the Px 
base reduced the misincorporation rate of the un- 
natural base substrates opposite the natural bases 
in templates without reducing the Ds-Px pairing se- 
lectivity. Under optimized conditions using Deep 
Vent DNA polymerase, the misincorporation rate 
was extremely low (0.005%/bp/replication), which 
is close to that of the natural base mispairings by 
the polymerase. DNA fragments with different 
sequence contexts were amplified ~10 10 -fold by 40 
cycles of PCR, and the selectivity of the Ds-Px 
pairing was >99.9%/replication, except for 99.77%/ 
replication for unfavorable purine-Ds-purine motifs. 
Furthermore, >97% of the Ds-Px pair in DNA 
survived in the 10 28 -fold amplified products after 
100-cycle PCR (10 cycles repeated 10 times). This 
highly specific Ds-Px pair system provides a frame- 
work for new biotechnology. 

INTRODUCTION 

Expansion of the genetic alphabet of DNA with artificial 
extra base pairs (unnatural base pairs) would have great 
potential for advancing the present genetic recombination 
systems to next generation biotechnologies. The genetic 



expansion by unnatural base pair systems could facilitate 
the site-specific incorporation of extra components into 
nucleic acids and proteins at desired positions, through 
the genetic information flow process comprising replica- 
tion, transcription and translation (1,2). Researchers have 
chemically synthesized many unnatural base pairs and 
tested their abilities, and some of them function as a 
third base pair in in vitro biological processes (3-9). In 
addition, the attachment of functional groups to unnat- 
ural bases enables the site-specific functionalization of 
DNA and RNA molecules by replication and transcrip- 
tion (10-16). The capabilities of several unnatural base 
pairs have been demonstrated, and they have been 
utilized in diagnostics for target DNA detection and in 
basic research for site-specific labeling of nucleic acid mol- 
ecules (9,17-21). For example, an unnatural base pair 
between isoguanine and isocytosine (22) is available as 
Plexor, in a multiplex real-time quantitative PCR detec- 
tion platform (23-25). 

Toward the next generation of biotechnology, the de- 
velopment of unnatural base pairs for specific PCR amp- 
lification is a seminal milestone, to allow the large-scale 
amplification of synthetic DNA fragments containing un- 
natural base pairs as a supply of DNA templates for use in 
further biological processes. Since the amplified DNA 
fragments are reused as templates in the following PCR 
amplification, high fidelity of the cognate base pairing is 
essential. If the selectivity of an unnatural base pairing, 
which is indicated by the incorporation selectivity (%) of 
the unnatural base substrates opposite their pairing 
partners in templates, is 99. 9% /replication, then the 
amplified DNA would retain 96% of the unnatural base 
pair after 40 cycles of PCR (0.999 40 = 0.961) in a theoret- 
ical exponential amplification. However, 99.0% selectivity 
of the unnatural base pairing would reduce the retention 
to 67% in the amplified products after 40 cycles of PCR 
(0.990 40 = 0.669). Furthermore, for the high fidelity of un- 
natural base pairing, extremely low misincorporation rates 
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of unnatural base substrates opposite the natural bases in 
templates are also required. Even if the misincorporation 
rate is 0.10%/natural bp/replication, up to 33% of the 
natural base pairs in a 10-base sequence flanked by PCR 
primers would be mutated to unnatural base pairs after 40 
cycles (0.33 = 1 - [(1 - 0.10/100) 10 ] 40 ) in a theoretical ex- 
ponential amplification. Thus, for the further development 
of unnatural base pair systems in PCR, the base pairing 
fidelity of both the high incorporation selectivity in 
cognate base pairings and the low misincorporation rate 
of unnatural bases opposite the natural bases, as well as 
the high amplification efficiency, are required to facilitate 
the advancement of the new biotechnology. 

So far, various unnatural base pairs that function in 
PCR with 96—99.8% incorporation selectivity per replica- 
tion or per PCR cycle have been reported (26-35), 
although the determination of the fidelity of unnatural 
base pairing depends on each researcher's method. 
Among them, we developed a hydrophobic unnatural 
base pair between 7-(2-thienyl)-imidazo[4,5-Z?]pyridine 
(Ds) and 2-nitro-4-propynylpyrrole (Px) for efficient 
PCR amplification (32). The Ds-Px pair was designed 
by strict shape-complementarity of the pairing surface, 
but both shapes are different from those of the natural 
bases. In addition, the Ds-Px pair lacks notable 
hydrogen bonds between the pairing bases, and thus the 
hydrophobic bases effectively exclude the non-cognate 
pairing with the natural bases. Furthermore, the Px base 
has the nitro group to electrostatically repel the mispairing 
with A (31). The oxygen of the nitro group of Px and the 
nitrogen of Ds, corresponding to the N3 position of the 
natural purine bases, are also important as proton 
acceptor residues, to interact with polymerases for effi- 
cient replication. By modifying a conventional sequencing 
method for the unnatural base pair, we determined the 
incorporation selectivity (> 99% /replication) of the Ds- 
Px pairing in PCR with Deep Vent DNA polymerase, 
possessing a 3' —> 5 f exonuclease activity (exo + ). In 
addition, we identified the efficiencies of the template 
sequence contexts of the natural bases around the unnat- 
ural base in PCR amplification. For example, in vitro se- 
lection experiments using a DNA library containing the 
unnatural base pair (32) and primer extension experiments 
using templates all combinations of 16 sequences of 
3 / -NDsN-5 / (N = A, G, C or T) revealed that the less ef- 
ficient sequences contain 5 / -YPxY-3 / /3 / -RDsR-5 / patterns 
(Morohashi, Kimoto, and Hirao, unpublished data), such 
as 5 / -GTTPxCAT-3 / /3 / -CAADsGTA-5 / (32), in PCR 
amplification. However, the exact differences among 
these sequence variants, in terms of the efficiency and 
fidelity of the Ds-Px pairing in PCR, still require clarifica- 
tion, and its PCR conditions can be further optimized. 
Through the process of unnatural base pair development, 
we have expected to discover other potentially important 
parameters of the Ds-Px pair in PCR amplification. 

Here, we present highly efficient, high-fidelity PCR 
systems involving the Ds-Px pair with optimized 
conditions for a variety of template sequence contexts. 
For this, we developed accurate determination methods 
for the fidelity of both the incorporation selectivity of 
the Ds-Px pairing and the misincorporation rates of the 



natural base replacement with the unnatural bases in 
DNA templates during PCR, as well as the amplification 
efficiency, by repeating 10 cycles of PCR to stay within 
exponential amplification. The Ds-Px pair survived 
through 100 cycles of PCR, and after 100 cycles of PCR 
(10 cycles of PCR repeated 10 times) under the optimized 
conditions, the DNA fragment containing the Ds-Px pair 
was amplified 10 28 -fold, and >97% of the Ds-Px pair was 
retained in the amplified products. Accordingly, the un- 
natural base pair selectivity was as high as 99.97% /repli- 
cation. Furthermore, we found that some functional 
groups attached to the Px base reduced the 
misincorporation rates of the unnatural base substrate 
opposite the natural bases in templates, without 
reducing the amplification efficiency and the incorpor- 
ation selectivity of the unnatural base pairing between 
Ds and modified Px. This Ds-Px pair system could be 
practically used for future genetic alphabet expansion 
technologies. 

MATERIALS AND METHODS 

General 

DNA templates containing Ds were synthesized with an 
Applied Biosystems 392 DNA synthesizer, using CE 
phosphoramidite reagents for the natural and Ds bases 
(Glen Research). DNA fragments comprising only the 
natural bases were synthesized or purchased from 
Invitrogen. The primer labeled with FAM at the 5'-end 
was purchased from Invitrogen. DNA fragments were 
purified by gel electrophoresis. The syntheses of dDsTP, 
NH 2 -hx-dPxTP and Cy5-hx-dPxTP was described previ- 
ously (30,32,36), and the syntheses of the other 
modified-dPxTPs are described in the Supplementary 
Data section. The following thermostable DNA polymer- 
ases were purchased: Deep Vent (exo + ), Deep Vent (exo~), 
Vent (exo + ), N9°, Phusion HF and Taq DNA pols from 
New England Biolabs; AccuPrime Pfx™ and Pfx50 DNA 
pols from Invitrogen, Pfu DNA pol from Promega; Pwo 
SY DNA pol from Roche; TITANIUM Taq DNA pol 
from Clontech. 

PCR involving the Ds-Px pairing 

PCR amplification (25 jil) of the 55-mer DNA fragments 
^- TTTCACACAGGAAACAGCTATGAC GG-Ny-CC 
CTATAGTGAGTCGTATTATC -3'; N 7 = 5'- 

ATCDsTAT-3' for Ds-temp 1, 5 / -CCCDsTTG-3 / for 
Ds-temp 2, 5 / -TACDsGTG-3 / for Ds-temp 3, 
S'-ATGDsAAC^' for Ds-temp 4 and 5 / -ATCCATT-3 / 
for Cont-temp) was performed by using 1 uM each 
5'-primer (40-mer; 5'-CGTTGTAAAACGACGGCCAG 
GATAATACGACTCACTATAG-30 and 3 / -primer 
(24-mer; 5 / -TTTCACACAGGAAACAGCTATGAC-3 / ) 
and each DNA pol at the manufacturer's recommended 
concentration (Deep Vent (exo + ), 0.5 U; Deep Vent 
(exo~), 0.5 U; AccuPrime Pfx, 1.25 U; Pfx50, 2.5 U; 
Phusion HF, 0.5 U; Pfu, 0.625 U; Pwo SY, 1.25 U; Vent 
(exo + ), 0.5 U; N9°, 0.5 U; TITANIUM Taq, lx; Taq, 
0.5 U), in the lx reaction buffer accompanying each 
DNA pol, supplemented with 50 uM each dDsTP and 
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modified-dPxTP, and 300 uM natural dNTPs, unless 
otherwise indicated. As for PCR with the AccuPrime 
Pfx DNA pol with final concentrations of 400 uM 
natural dNTPs, 100 uM natural dNTPs and 0.5 mM 
MgS0 4 were further added to the lx AccuPrime Pfx 
reaction mix, originally containing 300 uM natural 
dNTPs and 1 mM MgS0 4 . As for PCR with Deep Vent 
DNA pol (exo + ) with final concentrations of 500 uM 
natural dNTPs, 2 mM MgS0 4 was further added to the 
1 x ThermoPol reaction buffer originally containing 2 mM 
MgS0 4 . PCR conditions were 30 s at 94°C, 30 s at 45°C 
and 4 min at 65°C/cycle. 

To screen DNA pols, we performed a 15-cycle PCR, by 
using 0.5 nM 55-mer DNA template and FAM-labeled 
3 / -primer. The amplified PCR products, together with a 
FAM-labeled S'-primer (1 uM) as a marker, were analyzed 
by denaturing 15% polyacrylamide gel electrophoresis 
(PAGE). The full-length 75-mer DNA fragments on the 
gel were quantified by a bio-imaging analyzer LAS-4000 
(FUJIFILM) in the SYBR-mode, and by comparing the 
band intensities with that of the marker, the amplification 
folds were determined. For the sequencing analysis of the 
amplified fragments to determine the retention rates of the 
Ds-Px pair, as described below, PCR was performed by 
using non-labeled primers, and the full-length PCR 
products were purified by denaturing 8% PAGE. 

In the 40- and 100-cycle PCR amplifications, we 
repeated a 10-cycle PCR 4 or 10 times, to maintain expo- 
nential amplification conditions. In the first 10-cycle PCR, 
we used 2nM DNA template, and in the following 
10-cycle PCR amplifications, the amplified products were 
diluted 80- or 1 50-fold (5 ul) into a fresh PCR reaction mix 
(final dilutions: x750 for PCR with Ds-temp 1, Ds-temp 3, 
and Cont-temp and x400 for PCR with Ds-temp 4 for all 
substrates, except for x400 for PCR with Ds-temp 1 for 
BPh-dPxTP or HBP-hx-dPxTP). To determine the fold 
amplification, the amplified PCR products after each 
10-cycle PCR were analyzed by denaturing 15% PAGE, 
and the full-length 75-mer DNA fragments on the gel were 
quantified by the LAS-4000 analyzer after SYBR Green II 
staining. To determine the amplification fidelity, the 
amplified products after 10, 20, 40, 70 and 100 cycles of 
PCR were fractionated on an 8% denaturing gel, followed 
by an analysis of the retention rates of the unnatural base 
by sequencing (for Ds- temps 1, 3 and 4) and the 
misincorporation rate of the unnatural bases opposite 
the natural bases (for Cont-temp), as described below. 

DNA sequencing 

The cycle sequencing reaction (20 ul) was performed 
with the Cycle Sequencing mix (8 ul) from a BigDye 
Terminator vl.l Cycle Sequencing Kit (Applied 
Biosystems), containing 0.3 pmol of DNA fragments and 
4 pmol of Sequencing primer (20-mer; 5'-CGTTGTAAAA 
CGACGGCCAG-3), in the presence of 4-propynyl-l-((3- 
D-ribofuranosyl)pyrrole-2-carbaldehyde S'-triphosphate 
(dPa'TP) (40 pmol for Ds-temp 1 and 1 nmol for Ds- 
temp 2 and Ds-temp 3) or 1 nmol ddPa'TP. The 
sequencing cycling parameters were 25 cycles of 10 s at 
96°C, 5 s at 50°C and 4 min at 60°C. The residual dye 



terminator was removed by passage through a 
CENTRI-SEP column (Princeton Separations), and the 
solutions were dried. The residues were resuspended in 
4ul formamide solution and analyzed by an ABI 377 
DNA Sequencer, using a 6% polyacrylamide-6 M urea 
gel. The sequence data were analyzed with the Applied 
Biosystems PRISM sequencing analysis v3.2 software. 
The retention rates of the Ds-Px pair in the amplified 
PCR products were determined by comparing the 
sequencing patterns with the authentic sequencing 
patterns (see Supplementary Figures SI— SI 5). 

Real-time qPCR 

The PCR reaction (25 ul) was performed in 1 x ThermoPol 
reaction buffer with 300 uM each natural dNTP, 50 uM 
dDsTP and NH 2 -hx-dPxTP, 1 uM each 5 ; -primer and 
3'-primer, 30 000-fold diluted SYBR Green I (Lonza), 
1000-fold diluted ROX (Invitrogen) as a reference dye, 
0.5 unit of Deep Vent (exo + ) DNA pol, and 
DNA template (Ds-temp 1, Ds-temp 4 or Cont-temp) at 
different concentrations (final concentrations: 20 pM, 
2pM, 200 fM and 20 fM) on a Mx3005P Real-time 
system (Agilent Technologies). Data collection was per- 
formed with the detection filters for SYBR and ROX, 
and the data were analyzed with the Mx3005P v4.10 
software. The cycle thresholds were automatically set by 
the software, which gave the cycle threshold values (Ct). 

Analysis of misincorporation rates of unnatural bases into 
natural bases 

We first performed a 40- or 100-cycle PCR in the presence 
of modified-dPxTP by using Cont-temp as described 
earlier (first PCR), and the full-length amplified products 
after 10, 20, 40, 70 and 100 cycles of PCR were purified by 
denaturing 8% PAGE. The purified products (0.25 pmol) 
were subjected to seven extra cycles of PCR with the same 
concentration of the same DNA polymerase, but with 
Cy5-hx-dPxTP, instead of the modified-dPxTP (second 
PCR), to detect the modified Px base positions in the 
amplified DNA fragments. The conditions used in the 
second PCR were 1 uM each S'-primer and FAM-labeled 
3 / -primer, 50 uM dDsTP, 5uM Cy5-hx-dPxTP and 
300 uM natural dNTPs and 2 mM MgS0 4 for the Deep 
Vent DNA pol (exo + ) or 400 uM natural dNTPs and 
1.5 mM MgS0 4 for the AccuPrime Pfx DNA pol. We 
also performed a 7-cycle PCR directly by using lOnM 
(0.25 pmol) Ds-temp 1 and Cont-temp, to normalize the 
correct incorporation of Cy5-hx-dPxTP opposite Ds in 
the template during the second PCR and to subtract the 
background misincorporation of Cy5-hx-dPxTP during 
the second PCR, respectively. Under these conditions, 
we assumed that almost all of the Ds-Px pairs were 
retained after PCR amplification, since we confirmed the 
complete retention of the Ds-Px pair by a sequencing 
analysis of the amplified products of the 55-mer DNA 
template in the second PCR (data not shown). The 
second PCR products, together with a 75-mer 
single-stranded DNA (1 uM) as a marker, were analyzed 
by 15% denaturing PAGE. The fluorescent intensity of 
the full-length 75-mer DNA fragments on the gel 
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was quantified with a bio-imager, FLA-7000 
(GE-Healthcare) in the Cy5-mode and FAM-mode. 

Misincorporations of the unnatural bases into the first 
PCR-amplified DNAs were calculated as follows. First, 
the amounts of the full-length amplified products for 
each sample (A^ t [ sa mpie]) 5 as double-stranded DNA, were 
determined from the FAM intensity. Next, the Cy5 inten- 
sity for each sample (/cy 5 [sample]) was normalized to the 
amounts of the DNA (/ C y5[sam P ie]/N t [sam P ie]). The 
background misincorporation ratio was then sub- 
tracted from the relative Cy5 intensity (/cy5[sam P ie]/ 

^[sample] — ^Cy5[Cont-temp]/^t[Cont-temp])- Finally, each 

sample's misincorporation rate (M%) of the unnatural 
bases into the total amplified DNAs was calculated 
by the following formula: M(%) = (7c y 5[sampie]/ 

^t[sample] — ^Cy5[Cont-temp]/^t[Cont-temp])/(^Cy5[Ds-temp l]/^t[Ds- 
temp l]-^Cy5[Cont-temp]/^ t [Cont-temp]) X 100. We performed 

three independent experiments, and the obtained values 
were averaged. To calculate the misincorporation rates 
per bp per replication (F%), the misincorporation rates 
(M%) were plotted over the replication times [ = log 2 
(Amplification fold)], and were fitted by linear regression 



with a high correlation coefficient (R > 0.99). The 
slope (S%) of the regression line corresponds to 
the averaged misincorporation rate per replication (%), 
and the misincorporation rate per bp per replication 
(F%) was calculated by the following formula: 
F= [l-(l-S/100) 1/u ]x 100. 

RESULTS 

PCR conditions for the Ds-Px pairing 

We first examined the PCR conditions by using several 
DNA polymerases (DNA pols) and unnatural and 
natural base substrate combinations of dDsTP, NH 2 - 
hx-dPxTP (Figure 1) and natural base dNTPs at different 
concentrations. In addition, to determine the natural 
base sequence dependency around the unnatural base 
pair, we used four single-stranded Ds-containing DNA 
templates (55-mer, Ds-temps 1, 2, 3 and 4), as well as a 
control template (Cont-temp) comprising only the natural 
bases (Figure 2A). Our previous studies indicated that 
Ds-temp 1 containing S'-TATDsCTA^, Ds-temp 2 




N 



f 
N 




•H-N 




CH 3 



o 



O — H—N 



J-H---0 



V 



G H 





R 














— H — NH 2 




H 

-N 


o 

II 

-C— (CH 2 ) 5 -NH 2 


dPxTP NH 2 -dPxTP 


NH 2 -hx-dPxTP 


r 








^OH 






(CH 2 ) 2 - 


0-(CH 2 ) 3 /^ 0 H 


DioM-dPxTP 




Diol3o3-dPxTP 


O 

H II H 

— N-C-(CH 2 ) 5 -N 


o 
II 

-c- 


Ph 

-< 






DMP-dPxTP 


Ph 






O 

H II H 
— N-C-(CH 2 ) 5 -N 


0 
II 

-c- 


~\_ 


} 




NTP-dPxTP 


v_ 


) 




O 

H II H 
— N-C-(CH 2 ) 5 -N 


o 

II 

-c- 




J 




BPh-dPxTP 








o 

H II H 

— N-C-(CH 2 ) 5 -N 


0 


< 


> 




HBP-dPxTP 








O O 
H II H II 
— N-C-(CH 2 ) 5 -N-C 


-(CH 2 ) 5 


-nT 


Cy5-hx-dPxTP 


































so 3 - 



Figure 1. Chemical structures of the unnatural Ds-Px pair and the natural A-T and G-C pairs. Functional groups (R), attached to the Px base via 
the propynyl linker, used in this study are summarized in the enclosed box. 
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As. 



Ds-temp 1: 3'-TATDsCTA-5' 
Ds-temp 2: 3'-GTTDsCCC-5' 
Ds-temp 3: 3'-GTGDsCAT-5' 
Ds-temp 4: 3'-CAADsGTA-5' 
Cont-temp: 3'-TTACCTA-5' 
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Figure 2. PCR amplification involving the Ds-Px pairing with various DNA templates containing Ds for the determination of PCR conditions. 
(A) Scheme for PCR amplification experiments using 55-mer DNA fragments containing one Ds base, in the presence of dDsTP, NH 2 -hx-dPxTP and 
natural dNTPs. (B-D) Analyses by denaturing gel electrophoresis of PCR-amplified products after 15 cycles of PCR under different conditions by 
Deep Vent DNA pol (exo + ) (B) and AccuPrime Pfx DNA pol (C,D), for the determination of the fold amplification at the end point. 



containing S'-GTTDsCCC-S', and Ds-temp 3 containing 
3 / -GTGDsCAT-5 / are favorable templates, but Ds-temp 4 
containing 3 / -CAADsGTA-5 / (purine-Ds-purine) is an un- 
favorable one for efficient PCR amplification involving 
the Ds-Px pair by Deep Vent DNA polymerase (exo + ) 
(32). In this study, we incorporated only one unnatural 
base into the DNA templates, to focus on the accurate 
characterization of the Ds-Px pairing. To reduce the 
chemical DNA synthesis error, we utilized 55-mer 
single-stranded DNA templates, and the total base 
lengths were extended by PCR using 40-mer and 24-mer 
PCR primers to facilitate DNA sequencing directly after 
PCR amplification, which converted the templates into 
double-stranded 75-mer DNA products. Throughout all 
of the experiments in the study, we employed a 
PCR cycle program involving 94°C for 30 s, 45°C for 
30 s and 65°C for 4 min. After 15 cycles of PCR, the 
fold amplification of the PCR products (Supplementary 
Table SI) was determined by polyacrylamide gel electro- 
phoresis (representative data are shown in Figure 2B-D). 
Besides the full-length PCR products, truncated products 
paused after the unnatural base incorporations were also 
observed on the gel. The unconsumed primers were par- 
tially degraded by the exonuclease activity of the polymer- 
ases during PCR. 



The retention rates of the Ds-Px pair in the amplified 
DNA were determined by two sequencing methods of the 
Ds-containing DNA strands using conventional dideoxy 
dye-terminator sequencing, but either with a deoxyribo- 
nucleoside triphosphate of 4-propynylpyrrole-2- 
carbaldehyde (dPa'TP), another pairing partner of Ds, 
or with its dideoxyribonucleoside triphosphate 
(ddPa'TP) (30-32). Since no dye-terminator correspond- 
ing to the unnatural base is present in the sequencing, 
there is a gap at the unnatural base position on the 
sequencing pattern in the presence of dPa'TP. In 
another sequencing method in the presence of ddPa'TP, 
the sequencing reaction is stopped at the unnatural base 
position by incorporating ddPaTP opposite Ds in DNA 
templates, and thus the following base peaks disappear. 
However, if some mutation from an unnatural to natural 
base occurs at the original unnatural base position during 
PCR, then the height of each following base peak in- 
creases, depending on the mutation rate in the DNA 
samples. The sequencing with dPa'TP is useful for con- 
firming the unnatural base position, while that with 
ddPa'TP is appropriate for determining the unnatural 
base retention rate at the original position in the tem- 
plates, by comparing the peak heights on the sequencing 
patterns with those obtained using standard authentic 
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Figure 3. Sequencing analysis of the PCR products after 15 cycles of PCR by each DNA polymerase: (A) Deep Vent DNA pol (exo + ); 
(B) AccuPrime Pfx DNA pol; (C) Pfx50 DNA pol; (D) Pfu DNA pol; (E) Deep Vent DNA pol (exo - ); (F) TITANIUM Taq DNA pol. 
Sequencing reactions were performed in the presence of dPa'TP (left panels) or ddPa'TP (right panels) in each figure. The blue arrow indicates 
the original unnatural base position. The percentages indicated in the right panels for Ds-temp 1 and Ds-temp 4 are the retention rates of the Ds-Px 
pair in the amplified products after 15 cycles of PCR. The details of the calculation method for the retention rates are provided in the Supplementary 
Data section. 



samples. We increased the determination accuracy of the 
retention rates by improving the peak height quantifica- 
tion (see Supplementary Figures SI and S2). 

Using the unnatural base substrate combination of 
dDsTP and NH 2 -hx-dPxTP, we screened 1 1 DNA poly- 
merases, the Deep Vent (exo + ), Deep Vent (exo - ), 
AccuPrime Pfx, Pfx50, Phusion HF, Pfu, Pwo SY, Vent 
(exo + ), N9° DNA pols from family B, and the 
TITANIUM Taq and Taq DNA polymerases from 
family A, by 15 cycles of PCR with 50 uM dDsTP and 
NH 2 -hx-dPxTP and 300 uM natural dNTPs. 
Representative gel and sequencing patterns of the PCR 
products are shown in Figures 2 and 3, respectively, and 
all of the data for the amplification folds and the retention 
rates of the Ds-Px pair in the amplified DNA products are 
summarized in Supplementary Table SI and 
Supplementary Figures S3-S15. Although the sequencing 
patterns in the presence of dPa ; TP for Ds-temp 4 did not 
give a clear gap corresponding to the unnatural base 
position (30), the retention rates of the Ds-Px pair in 
the PCR products were accurately determined from the 
sequencing patterns in the presence of ddPa ; TP 
(Figure 3). In the PCR amplification of Ds- temps 1, 



2 and 3 using the family B polymerases, most of the 
Ds-Px pair (>97%) was retained in the PCR products, 
although the fold amplification depended on the polymer- 
ase. In this analysis, >97% of the retention rate is the 
maximum detection limit. In the PCR amplification of 
the unfavorable Ds-temp 4, the retention rates of the 
Ds-Px pair and the amplification folds varied with the 
polymerase, but most of them were lower than those of 
Ds- temps 1, 2 and 3. Among them, the Deep Vent (exo + ) 
(Figure 3A), AccuPrime Pfx (Figure 3B) and Pfx50 DNA 
pols (Figure 3C) exhibited high amplification folds and 
retention rates with all of the templates. In addition, 
increasing the concentration of the natural base substrates 
[dNTPs, 500 uM for Deep Vent (exo + ) and 400 uM for 
AccuPrime Pfx] improved the amplification folds, espe- 
cially those of Ds-temp 4, without a significant reduction 
in their retention rates. For example, in a 15-cycle PCR 
amplification by AccuPrime Pfx DNA pol with 50 uM 
dDsTP and NH 2 -hx-dPxTP and 400 uM natural dNTPs, 
the amplification folds were 1774 for Ds-temp 1 and 1503 
for the unfavorable Ds-temp 4, and were only slightly 
lower than that (1955-fold) for Cont-temp. Under the 
same conditions, the retention rates of the Ds-Px pair in 
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the amplified products were >97% for Ds-temp 1 and 
93 ± 2% for Ds-temp 4. The dependency of the amplifi- 
cation folds on the natural base substrate concentrations 
suggests that the natural base incorporation after the un- 
natural base pairing is one of the rate limiting steps, and 
thus the amplification efficiency could be improved by 
increasing the natural base substrate concentrations. 
Figure 2C and D also support this suggestion, and the 
amounts of the truncated products were significantly 
reduced by increasing the concentrations of the natural 
base substrates. 

Next, we examined the effect of the 3' 5' exonuclease 
activity of the DNA polymerases on the unnatural base 
pair PCR amplification, using Deep Vent DNA pols with 
(exo + ) and without the exonuclease activity (exo - ). Deep 
Vent DNA pol (exo - ) performed with relatively high effi- 
ciency (424-fold after 15 cycles of PCR) and selectivity 
(>97% retention rate of the unnatural base pair). 
However, the sequencing patterns in the presence of 
ddPa'TP of the PCR products using Ds-temp 1 and 
Cont-temp (Figure 3E) indicated a measurable problem 
with unnatural base misincorporation opposite the 
natural bases. As shown in the ddPa'TP sequencing 
using Ds-temp 1 and Cont-temp (Figure 3E), the peak 
heights corresponding to the natural bases gradually 
decreased, relative to those corresponding to the 
exonuclease-proficient pol (exo + ) (Figure 3A). During 
the sequencing reaction, ddPa'TP may have been 
incorporated opposite multiple Ds positions, which 
resulted from the unnatural base misincorporations in 
place of the natural bases during PCR amplification, 
and the sequencing reaction was paused, depending on 
the unnatural base misincorporation rate by the 
ddPaTP incorporation opposite Ds. Therefore, the 
y -> 5' exonuclease activity of polymerases is important 
to remove the unnatural base substrates misincorporated 
opposite the natural bases during PCR. 

We also examined PCR amplification using family A 
(Taq and TITANIUM Taq) DNA pols. The Taq and 
TITANIUM Taq DNA pols, which have intrinsically no 
y 5' exonuclease activity, were ineffective, even in 
PCRs using Ds- temps 1 and 2 (Figure 3F and 
Supplementary Table SI). The selectivity of the Ds-Px 
pairing was much lower than that using the family B 
DNA pols. However, based on the sequencing pattern in 
the presence of ddPa'TP (Figure 3F), no gradual attenu- 
ation of each peak was observed, as compared to the 
pattern from the template obtained using Deep Vent 
DNA pol (exo - ) (Figure 3E). Thus, the misincorporation 
rates of the unnatural base substrates opposite the natural 
bases are not very high when using Taq and TITANIUM 
Taq DNA pols. Indeed, our recent study showed that 
TITANIUM Taq DNA pol exhibited high selectivity 
and efficiency in a one-way incorporation of modified 
dPxTP opposite Ds in templates, and is useful for the 
unnatural base incorporation in a primer region in PCR 
(36). These results suggest that the TITANIUM Taq and 
Taq DNA pols efficiently incorporate dPxTP into DNA 
opposite Ds, but are not useful to incorporate dDsTP into 
DNA opposite Px. 



Sequence dependence of the PCR amplification efficiency 

We determined the exact amplification efficiency using the 
less effective template, Ds-temp 4, in comparison to those 
using Ds-temp 1 and Cont-temp, by real-time PCR. Four 
10-fold dilutions (20fM-20pM) of each template were 
amplified by Deep Vent DNA pol (exo + ) with 50 uM 
dDsTP and NH 2 -hx-dPxTP and 300 uM natural dNTPs, 
and the amplified products were monitored by the conven- 
tional SYBR Green dye detection system with a real-time 
PCR instrument (Figure 4A-C). Figure 4D shows the 
standard curves of the PCR amplification using each 
template, from which the amplification efficiencies 
( = [10 ( - 1/slope) ]-l) of the PCRs using Ds-temp 1, 
Ds-temp 4 and Cont-temp [0.91, 0.80 and 1.04 (-1.0), re- 
spectively] were determined. Although there are some dif- 
ferences among the template sequence contexts with or 
without the unnatural base, the values (0.80-0.91) of the 
amplification efficiency using the Ds-containing templates 
would be acceptable for the practical use of PCR involving 
the Ds-Px pair. Indeed, after 40 cycles of PCR starting 
from single-stranded DNA templates, the DNA fragments 
(as double strands) would theoretically be amplified 
9.0 x 10 9 -fold (= 1.80 39 ) for 0.80 and 9.1 x 10 10 -fold for 
0.91, which coincide well with the experimental data, as 
discussed later (Table 1). 

We also found that the PCR amplification with 
Ds-temp 4 took a longer time to reach the exponential 
phase than those with Ds-temp 1 and Cont-temp 
(Figure 4). The j-intercept values in the standard curves 
were significantly different among the templates (41.73 
using Ds-temp 1, 46.78 using Ds-temp 4 and 37.46 using 
Cont-temp) (Figure 4D). The Cont-temp value (37.46) is 
reasonable, relative to that in conventional real-time PCR 
detection, but the values (41.73 and 46.78) obtained using 
Ds-containing templates, especially Ds-temp 4, are rela- 
tively high. In general, the high j-intercept values result 
from the degradation of templates during PCR, but we 
confirmed the stability of the Ds-containing DNA tem- 
plates under the PCR conditions. A plausible possibility 
is that the complex of the polymerase and the Ds-contain- 
ing template, especially Ds-temp 4, may dissociate at low 
template concentrations. Therefore, increasing the initial 
concentration of the DNA template, as well as the sub- 
strate concentrations as mentioned earlier, might increase 
the fold amplification when using an unfavorable sequence 
context, such as Ds-temp 4. 

100 cycles of PCR involving the Ds-Px pairing 

Using the optimized PCR conditions, we performed 100 
cycles of PCR amplification of Ds-temp 1 with dDsTP and 
NH 2 -hx-dPxTP by the Deep Vent (exo + ) and AccuPrime 
Pfx DNA pols, to examine the abilities and limitations of 
the Ds-Px pair in PCR, as well as to increase the 
maximum detection limits (>97%) of the unnatural base 
pair retention rates in the amplified products. To maintain 
the exponential amplification through 100 cycles of PCR, 
we repeated the process of a 10-cycle PCR and the dilution 
of the PCR solution 10 times. Ds-temp 1 (50 fmol) was 
amplified by 10 cycles of PCR with 30 pM dDsTP and 
NH 2 -hx-dPxTP and 300 (iM natural dNTPs with Deep 
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Figure 4. Real-time quantitative PCR amplification plots of each DNA template [(A) Ds-temp 1, (B) Ds-temp 4, (C) Cont-temp] and their linear 
standard curve analysis (D). PCR was performed with Deep Vent DNA pol (exo + ) in the presence of 50jiM dDsTP and NH 2 -hx-dPxTP, 300 jiM 
natural dNTPs, and different concentrations of DNA templates (20 fM, 200 fM, 2pM, 20 pM). 



Table 1. Amplification efficiencies and fidelities of the Ds-Px pairing in 40 cycles of PCR in different template sequence contexts, using dDsTP 
and NH 2 -hx-dPxTP 



DNA 


DNA 


Natural 


Unnatural 


Amplification 
fold c 


Amplification 


Selectivity 11 


Misincorporation 
rate 6 


polymerase 


template 


substrates a 


substrates b 


efficiency 


(% /replication) 






(pM) 


OiM) 










(% /bp/replication) 


Deep Vent (exo + ) 


Ds-temp 1 


300 


30 


9.6 x 


10 10 


0.91 


99.92 










40 


9.0 x 


10 10 


0.91 


99.92 










50 


6.7 x 


10 10 


0.89 


99.92 






Ds-temp 3 




30 


6.6 x 


10 10 


0.89 


99.92 










50 


4.8 x 


10 10 


0.88 


99.91 






Ds-temp 4 




30 


1.4 x 


10 10 


0.82 


99.61 










50 


8.4 x 


10 9 


0.80 


99.75 






Cont-temp 




30 


5.7 x 


10 10 


0.89 




0.007 








40 


9.4 x 


10 10 


0.91 




0.010 








50 


7.2 x 


10 10 


0.90 




0.012 


AccuPrime Pfx 


Ds-temp 1 


400 


30 


7.7 x 


10 10 


0.90 


99.74 










50 


1.0 x 


10 11 


0.91 


99.88 






Ds-temp 3 




30 


1.0 x 


10 11 


0.91 


99.72 










50 


7.7 x 


10 10 


0.90 


99.89 
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30 


1.7 x 


10 10 


0.83 


98.94 










50 


1.2 x 


10 10 


0.81 


99.22 






Cont-temp 




30 


1.0 x 


10 11 


0.91 




0.011 








40 


1.4 x 


10 11 


0.93 




0.016 








50 


8.0 x 


10 10 


0.90 




0.023 






600 


50 


1.0 x 


10 11 


0.91 




0.017 



Concentration of each natural base substrate (dNTPs), Concentration of dDsTP and NH 2 -hx-dPxTP, Amplification folds after 40 cycles of PCR, 
Selectivity of the unnatural base pairing, e Misincorporation rate of the unnatural base substrates opposite the natural bases in templates. 
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Vent DNA pol (exo + ) or with 50 |nM dDsTP and NH 2 - 
hx-dPxTP and 400 uM natural dNTPs with AccuPrime 
Pfx DNA pol, and the PCR solution was diluted 
750-fold for the following amplification (Figure 5A). 
From the gel electrophoresis of the products after 100 
cycles of PCR (Figure 5C), Ds-temp 1 was amplified 
5 x 10 27 -fold by Deep Vent DNA pol (exo + ) and 
2 x 10 28 -fold by AccuPrime Pfx DNA pol. The position 
of the Ds-Px pair in the amplified products was confirmed 
by sequencing in the presence of dPa'TP (left panels in 
Figure 5D and E). The retention rates of the Ds-Px pair 
in the amplified products were determined by sequencing 
in the presence of ddPa'TP (right panels in Figure 5D and 
E), and for the Ds-Px pair, >97% for Deep Vent DNA 
pol (exo + ) and 88.4% for AccuPrime Pfx DNA pol were 
retained in the amplified DNA after 100 cycles of PCR. 
The selectivities of the Ds-Px pairing were determined 
from the amplification folds and the retention rates, and 
were as high as 99.97 and 99. 87% /replication for Deep 
Vent DNA pol (exo + ) and AccuPrime Pfx DNA pol, 
respectively. 

Another important consideration is the 
misincorporation rates of the unnatural base substrates 
opposite the natural bases in templates during PCR amp- 
lification. To determine the misincorporation rates, we de- 
veloped another method (Figure 5B), in which the 
misincorporated PCR products were specifically labeled 
by a second PCR with fluorescent dye-linked dPxTP. 
The natural base template, Cont-temp, was amplified by 
1 00 cycles of PCR in the presence of the examined unnat- 
ural base substrates, and the products were again 
amplified by 7 cycles of the second PCR with another 
combination of fluorescent dye-attached unnatural base 
substrates, Cy5-hx-dPxTP (36) (Figure 1) and dDsTP, 
using a fluorescein (FAM)-labeled primer. The amplified 
products were then analyzed by Cy5 and FAM double 
detection on the gel. The fluorescence of Cy5 represents 
the replacement of the misincorporated unnatural bases 
(see Figure 5G), and the fluorescence of FAM corresponds 
to the total amplified DNA fragments (see Figure 5F). 
During the 7-cycle PCR with dDsTP and Cy5-hx- 
dPxTP, these unnatural base substrates are also slightly 
misincorporated opposite the natural bases, and thus the 
initial DNA template was also amplified by the second 
PCR with dDsTP and Cy5-hx-dPxTP (see the 
Cont-temp lane in Figure 5G), to subtract 
the misincorporation rate of Cy5-hx-Px from those after 
the first and second PCR amplifications. 

Figure 5F and G show the gel electrophoreses for 
determining the misincorporation rates of unnatural 
base substrates opposite the natural bases after 10-, 20-, 
40-, 70- and 100-cycles of the first PCR using 30 uM 
dDsTP and NH 2 -hx-dPxTP and 300 uM natural dNTPs 
by Deep Vent DNA pol (exo + ), and the data including 
the misincorporation rates (% in total amplified 
products, %/bp in total amplified products, %/bp/replica- 
tion) are summarized in Figure 5H. After 100 cycles of 
PCR, Cont-temp was amplified 8 x 10 27 -fold. Although 
6.85% of the amplified DNA fragments contained the un- 
natural base pairs, resulting from the unnatural base 
misincorporation, after 100 cycles of PCR, the 



misincorporation rate was only 0.0067 % /bp/replication. 
This misincorporation rate corresponds to an error rate 
of 6.7 x 10 -5 error/bp, which is very close to the intrinsic 
error rate among the natural bases by Deep Vent DNA 
pol (exo + ) (~2 x 10 _5 error/bp, from the New England 
BioLabs information). 

Fidelity of the Ds-Px pairing in 40 cycles of PCR 

Next, we determined the fidelity of both the selectivity of 
the Ds-Px pairing and the misincorporation rate of the 
unnatural base substrates opposite the natural bases. We 
performed 40 cycles of PCR (10 cycles repeated four 
times) using various templates, Ds-temps 1, 3 and 4 and 
Cont-temp, with different concentrations of dDsTP, NH 2 - 
hx-dPxTP, the natural dNTPs, and the Deep Vent (exo + ) 
and AccuPrime Pfx DNA pols. The amplification 
folds and efficiencies, the selectivities of the Ds-Px 
pairing, and the misincorporation rates under several 
PCR conditions are summarized in Table 1. When using 
50 uM dDsTP and NH 2 -hx-dPxTP and 300 uM natural 
dNTPs, the PCR amplification folds of each template by 
Deep Vent DNA pol (exo + ) were relatively high 
(6.7 x 10 10 -fold for Ds-temp 1, 4.8 x 10 10 -fold for Ds- 
temp 3 and 8.4 x 10 9 -fold for Ds-temp 4), and from the 
amplification folds, the amplification efficiencies [amplifi- 
cation fold = (1 + amplification efficiency)", where 
n = PCR cycles — 1 for single-stranded templates] were 
estimated as 0.89 for Ds-temp 1, 0.88 for Ds-temp 3 and 
0.80 for Ds-temp 4. These efficiencies closely agreed with 
those (0.91 for Ds-temp 1 and 0.80 for Ds-temp 4) 
obtained by the real-time PCR experiments as mentioned 
earlier, confirming the accuracy of our determination 
methods. Furthermore, reducing the concentration of 
the unnatural base substrates, from 50 to 30 uM each, 
improved both the amplification folds and 
misincorporation rates without any significant loss of the 
Ds-Px pairing selectivity. The improved amplification ef- 
ficiency might be a consequence of the reduced 
misincorporation rates opposite the natural bases. In 
PCR with 30 uM dDsTP, NH 2 -hx-dPxTP and 300 uM 
natural dNTPs for all template sequence contexts, the 
amplification efficiency, the selectivity and the 
misincorporation rate were 0.82-0.91 and 99.61-99.92%/ 
replication, and 0.007%/bp/replication, respectively. The 
PCR amplification by the AccuPrime Pfx DNA pol was 
less selective, as compared to that by the Deep Vent DNA 
pol. However, it exhibited the highest amplification effi- 
ciency among the examined polymerases, and thus 
decreasing the number of PCR cycles could compensate 
for the selectivity loss when using the AccuPrime Pfx 
DNA pol. 

Side-chain effects of the Px base on the Ds-Px pair 
fidelity 

During the extensive experiments of PCR amplification 
using modified Px base substrates for generating new 
functional DNA molecules, we found that modifications 
of the Px base positively affected the fidelity of the Ds-Px 
pairing in PCR. One of the advantages of the Ds-Px pair 
is the capability of the modified Px base, as we used 
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Figure 5. Amplification efficiency and fidelity assessments of the Ds-Px pairing by 40-cycle and 100-cycle PCR amplifications (repeated 10-cycle 
PCR). (A) Scheme of the 40- or 100-cycle PCR amplifications of the Ds-containing DNA templates using dDsTP and modified-dPxTP, for the 
determination of the fold amplification, efficiency and selectivity of the Ds-Px pairing in PCR. (B) Scheme of the 40- or 100-cycle PCR amplification 
of the DNA template comprising only the natural bases using dDsTP and modified-dPxTP, for the determination of the misincorporation rates of 
the unnatural base substrates opposite the natural bases. The results are summarized in Tables 1 and 2. (C) Gel electrophoresis of the amplification 
products after each 10-cycle PCR (total 100 cycles of PCR) of Ds-temp 1 by the Deep Vent DNA pol (exo + ) using 1 uM each primer, 30 uM dDsTP 
and NH 2 -hx-dPxTP and 300 uM natural dNTPs. A portion of each 10-PCR solution before (lanes T) and after each 10-cycle PCR (lanes +) was 
analyzed by 15% polyacrylamide denaturing gel electrophoresis, followed by SYBR Green II staining. M: marker (75-mer single-stranded DNA). 
(D and E) Sequencing analysis of the PCR products after 10, 40, 70 and 100 cycles of PCR by Deep Vent DNA pol (exo + ) (D) and AccuPrime Pfx 
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Figure 5. Continued 

DNA pol (E). Sequencing reactions were performed in the presence of dPa'TP (left panels) or ddPa'TP (right panels). The blue arrow indicates the 
original unnatural base position. The percentage indicated in the right panels is the retention rate of the Ds-Px pair in the amplified products. The 
method for the calculation of the retention rates is provided in the Supplementary Data section. (F-H) Determination of misincorporation rates of 
the unnatural base substrates opposite the natural bases in Cont-temp after 10, 20, 40, 70 and 100 cycles of PCR by the Deep Vent DNA pol (exo + ) 
using 30 uM dDsTP and NH 2 -hx-dPxTP and 300 uM natural dNTPs (first PCR). The second PCR products with the FAM-labeled 3 / -primer and 
Cy5-hx-dPxTP were analyzed by 15% denaturing PAGE, and the DNA fragments were detected with FAM fluorescence (F) for the quantification of 
the total PCR products and with Cy5 fluorescence (G) for the quantification of the unnatural base misincorporated products. (H) Misincorporation 
rates calculated from the data are summarized in the panel. 



Cy5-hx-dPxTP to site- specifically incorporate it into 
DNA opposite Ds. A variety of functional groups can 
be attached to the Px base via the propynyl linker, and 
these modified dPxTPs are also site-specifically 
incorporated opposite Ds in replication. We synthesized 
several modified Px substrates by attaching amino (NH 2 - 
Cl- and NH 2 -hx-dPxTPs), diol (Dioll- and Diol3o3- 
dPxTPs), and aromatic groups (DMP-, NTP-, BPh- and 
HBP-dPxTPs) with different linkers, as well as the 
non-modified dPxTP (Figure 1, see Supplementary Data 
section for the syntheses). The amplification efficiency and 
fidelity of the Ds-Px pairing using these modified dPxTP 
were assessed by 40 cycles of PCR with 50 uM dDsTP and 
modified dPxTP, 300 uM natural dNTPs, DNA templates 
(Ds-temps 1 and 4 and Cont-temp) and the Deep Vent 
DNA pol (exo + ). 

Table 2 shows the amplification fold and efficiency, the 
selectivity of the Ds-Px pairing, and the misincorporation 
rate for PCR amplification using each unnatural base 
pair combination of modified dPxTP and dDsTP. 
Most of the modified dPxTPs, except for the biphenyl 
modifications (BPh- and HBP-dPxTPs), exhibited rela- 
tively high efficiency (0.79-0.91, around 10 10 -fold amplifi- 
cation after 40 cycles of PCR) and high selectivity 
(>99.9%/replication) when using Ds-temp 1. In the 
PCR amplification of Ds-temp 4, the efficiency and 



selectivity with all of the modified dPxTPs were decreased 
but were still as high as 0.75-0.80 and 99.22-99.87%/rep- 
lication, respectively. 

The biphenyl modifications (BPh- and HBP-dPxTPs) 
showed a unique sequence context dependency (Table 2). 
Although the amplification efficiency and selectivity 
were lower than those of the other modified Px substrates, 
these modifications worked better with Ds-temp 
4 than Ds-temp 1 in PCR, even by the Deep Vent DNA 
pol (exo + ). For example, in PCR using HBP-dPxTPs and 
dDsTP, the amplification efficiencies were 0.68 for 
Ds-temp 1 and 0.71 for Ds-temp 4, and the selectivities 
of the Ds-Px pairing were 98.97% for Ds-temp 1 and 
99.83% for Ds-temp 4, with a relatively low 
misincorporation rate opposite the natural bases 
(0.008% /bp/replication) . 

We also found that Px modifications improved the 
fidelity of the Ds-Px pairing in PCR. As compared with 
the misincorporation rate using dPxTP (0.0 19 %/bp/repli- 
cation), all of the Px modifications reduced the 
misincorporation rates (0.003-0.0 16%/bp/replication). 
Among the modified dPxTPs, the selectivity of the Ds- 
Px pairing was less correlated with the misincorporation 
rate, and the diol-modified dPxTPs showed the lowest 
misincorporation rate (0.003-0.005 %/bp/replication) 
even when using high concentrations (50 uM) of the 
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Table 2. Amplification efficiencies and fidelities of the Ds-Px pairing in 40 cycles of PCR in different template sequence contexts using several 
combinations of dDsTP and modified dPxTP by Deep Vent DNA pol (exo + ) 
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Amplification folds after 40 cycles of PCR using 50 fiM dDsTP, modified-dPxTP and 300 jiM natural base dNTPs, b Selectivity of the unnatural base 
pairing, c Misincorporation rate of the unnatural base substrates opposite the natural bases in templates. 



unnatural base substrates, without any significant reduc- 
tions in the amplification fold and the selectivity of the 
Ds-Px pairing. From the overall evaluation among these 
modified dPxTPs, the combination of Dioll-dPxTP and 
dDsTP was the best for PCR amplification, and exhibited 
the highest amplification efficiency (0.79-0.89) and select- 
ivity (99. 77-99. 92 %/replication) among all of the tem- 
plates, and the lowest misincorporation rate (0.005%/bp/ 
replications 5 x 10 -5 error/bp of error rate) opposite the 
natural base templates. These results suggest that the 
modification of unnatural bases protruding into the 
major groove side also affects the fidelity of the unnatural 
base pairing in replication. 



DISCUSSION 

We fine-turned the PCR conditions involving the Ds-Px 
pair as a third base pair and developed methods for as- 
sessing the amplification efficiency and fidelity of the un- 
natural base pairing. In the methods, we increased the 
accuracy of the determination of the PCR amplification 
efficiency and the fidelity of the Ds-Px pair by maintain- 
ing exponential amplification conditions, by repeating 
10-cycle PCRs. The best combination for the Ds-Px pair 
PCR system is dDsTP and Dioll-dPxTP. Under the 
optimized conditions using 50 uM dDsTP and 
Dioll-dPxTP and 300 (iM natural dNTPs by Deep Vent 



(exo + ) pol, DNA fragments with different sequence 
contexts, except for purine-Ds-purine motifs, were 
amplified M0 10 -fold (amplification efficiency = 0.89) by 
40 cycles of PCR and the selectivity of the cognate 
Ds-Px pairing was > 99. 9 %/replication. Even in PCR 
with unfavorable sequence contexts containing purine- 
Ds-purine, the unnatural base pair selectivity was 
99. 77 %/replication, with high amplification efficiency 
(0.79) similar to those of other sequence contexts. The 
diol modification of the Px base reduced the 
misincorporation rate of the unnatural base substrates 
opposite the natural bases in template, which was 
0.005% /bp/replication. The low misincorporation rate of 
the Ds and Dioll-Px substrates (0. 00 5% /bp/replica- 
tion = 5 x 10 -5 error/bp) is very close to the intrinsic 
mispairing error rate among the natural bases 
(~2 x 10~ 5 error/bp) of Deep Vent DNA pol (exo + ). 
Therefore, the ability of the unnatural base pairing is ap- 
proaching those of the natural A-T and G-C pairings. 

The Ds-Px pair in combination with dDsTP and NH 2 - 
hx-dPxTP also exhibited high efficiency and fidelity in 
PCR amplification using the Deep Vent (exo + ) and 
AccuPrime Pfx DNA pols. The selectivities in PCR with 
Ds-temp 1 using 30 pM dDsTP and NH 2 -hx-dPxTP and 
300 uM natural dNTPs were 99.92%/replication for Deep 
Vent (exo + ) pol and 99.74% for AccuPrime Pfx DNA pol. 
Under the same conditions, the misincorporation rates 
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were 0.007%/bp/replication for Deep Vent (exo + ) pol and 
0.011%/bp/replication for AccuPrime Pfx DNA pol. We 
demonstrated the great potential of the Ds-Px pair by 
performing 100 cycles of PCR using dDsTP and NH 2 - 
hx-dPxTP. Even in PCR using the unfavorable template 
sequence context by Deep Vent (exo + ) pol, the amplifica- 
tion efficiency (0.82 for Ds-temp 4) and the selectivity 
(99. 6% /replication for Ds-temp 4) adequately reached 
practical levels for PCR. Although the PCR amplification 
by the AccuPrime Pfx DNA pol was less selective than 
that by Deep Vent (exo + ), the higher amplification effi- 
ciency is compensated by the shorter PCR cycles. In 
addition, the AccuPrime Pfx DNA pol might be useful 
for PCR amplification of DNA templates containing 
multiple unnatural bases. Furthermore, a variety of 
modified Px substrates can also be used for the Ds-Px 
pair PCR system, enabling the site-specific 
functionalization of DNA molecules. 

The present analyses of PCR involving the Ds-Px pair 
suggest several points for the further development of un- 
natural base pair systems. Some of the points might also 
offer suggestions for the mechanisms of the natural base 
pairing in replication and the development of natural base 
pair PCR systems with high efficiency and fidelity, (i) The 
main role of the 3' —> 5 f exonuclease activity of Deep Vent 
DNA pol is to prevent the misincorporation of the unnat- 
ural base substrates opposite the natural bases. Thus, the 
polymerase proofreading mechanisms function for the rec- 
ognition of non-cognate pairs between the natural and 
unnatural bases. In addition, mixed polymerases with 
and without exonuclease activity might improve the un- 
natural base pair efficiency and selectivity. Indeed, 
AccuPrime Pfx DNA pol contains two types of polymer- 
ases, (ii) One of the rate limiting steps is the natural base 
incorporation opposite its complementary base after the 
unnatural base pair, as well as the unnatural base pairing. 
Thus, increasing the concentration of the natural base 
substrates could improve the amplification efficiency 
without any loss of fidelity in PCR. (iii) The real-time 
PCR analyses suggested that the amplification fold is 
also affected by the initial template concentration, and a 
higher template concentration could increase the fold 
amplification with fewer PCR cycles. The less effective 
purine-Ds-purine sequences might reduce the stability of 
the polymerase and substrate complex, and thus 
increasing the concentration of each component is import- 
ant for efficient PCR amplification for all template 
sequence contexts and multiple incorporations of the un- 
natural base pairs, (iv) Extra side chains attached to un- 
natural bases affect the fidelity of the unnatural base 
pairing. Among the side chain functional groups, some 
of them increase the unnatural base pair fidelity and 
others change the polymerase's preference for the 
sequence contexts. 

Our studies of the Ds-Px pair in PCR have confirmed 
the utility of a third base pair as a replicative component 
with high efficiency and fidelity, besides the natural A-T 
and G-C base pairs, and further development of the un- 
natural base pair systems and applications to new 
biotechnologies are expected in the future. 
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